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SEQUENCE LISTING 

<110> Lassen, Soren Flensted 

<120> Improved proteases and methods for producing them 

<130> 10495. 204-US 

<160> 53 

<170> Patentln version 3.3 

<210> 1 

<211> 1062 

<212> DNA 

<213> Nocardiopsis sp. NRRL 18262 



<220> 

<221> misc_feature 
<222> (1) . . (495) 

<223> Encodes the pro-region shown in positions -165 to -1 of SEQ ID 
NO: 43. 

<220> 

<221> misc_feature 
<222> (496) . . (1059) 

<223> Encodes the mature region shown in positions 1-188 of SEQ ID 
NO: 43. 

<400> 1 



gctactggag 


cattacctca 


gtctcctaca 


cctgaagcag 


atgcagtatc 


gatgcaagaa 


60 


gcattacaac 


gtgatcttga 


tcttacatca 


gctgaagctg 


aggaattact 


tgctgcacaa 


120 


gatacagcct 


ttgaagttga 


tgaagctgcc 


gctgaagcag 


ctggtgatgc 


atatggtggt 


180 


tcagtattcg 


atactgaatc 


actcgaactt 


actgtactag 


tgaccgatgc 


agcagctgtt 


240 


gaagctgttg 


aagccacagg 


tgcaggtaca 


gagctcgtat 


cttatggtat 


tgatggatta 


300 


gatgagatcg 


tacaagagct 


taatgcagct 


gatgccgttc 


caggtgtagt 


tggatggtat 


360 


cctgatgtag 


caggtgatac 


tgttgtctta 


gaagttcttg 


aaggctctgg 


agctgatgtt 


420 


tctggacttt 


tagcagacgc 


aggagtcgat 


gcatccgcgg 


ttgaagtgac 


cacgtcagat 


480 


cagcctgaac 


tctatgccga 


tatcattgga 


ggcctagcgt 


acacaatggg 


tggtcgctgc 


540 


agcgtaggat 


ttgcagccac 


aaatgcagct 


ggacaacctg 


gcttcgtgac 


agctggacat 


600 


tgcggccgcg 


tcggtacaca 


ggttactatc 


ggcaatggaa 


gaggtgtctt 


tgagcaaagc 


660 


gtatttcccg 


ggaatgatgc 


tgccttcgtt 


agaggtacgt 


ccaactttac 


gcttactaac 


720 


ttagtatcta 


gatacaacac 


tggcggatat 


gcaactgtag 


caggtcacaa 


tcaagcacct 


780 
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attggctcta gcgtctgccg ctcagggtcg actacaggat ggcattgtgg aaccattcaa 840 

gctagaggtc agagcgtgag ctatcctgaa ggtaccgtaa cgaacatgac tcgtacgact 900 

gtatgtgcag aaccaggtga ctctggaggt tcatatatca gcggtacgca agcgcaaggc 960 

gttacctcag gtggatccgg taactgtagg acaggtggca caacgttcta ccaggaagtg 1020 

acaccgatgg tgaactcttg gggagttaga ctccgtacat aa 1062 

<210> 2 
<211> 1143 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> A synthetic 10R gene (10Rsynt-15) encoding a S2A protease denoted 
"10R" fused by PCR in frame to the signal peptide encoding 
sequence of a heterologous protease, Savinase. 



<400> 2 
atgaagaaac 


cgttggggaa 


aattgtcgca 


agcaccgcac 


tactcatttc 


tgttgctttt 


60 


agttcatcga 


tcgcatcggc 


tgctactgga 


gcattacctc 


agtctcctac 


acctgaagca 


120 


gatgcagtat 


cgatgcaaga 


agcattacaa 


cgtgatcttg 


atcttacatc 


agctgaagct 


180 


gaggaattac 


ttgctgcaca 


agatacagcc 


tttgaagttg 


atgaagctgc 


cgctgaagca 


240 


gctggtgatg 


catatggtgg 


ttcagtattc 


gatactgaat 


cactcgaact 


tactgtacta 


300 


gtgaccgatg 


cagcagctgt 


tgaagctgtt 


gaagccacag 


gtgcaggtac 


agagctcgta 


360 


tcttatggta 


ttgatggatt 


agatgagatc 


gtacaagagc 


ttaatgcagc 


tgatgccgtt 


420 


ccaggtgtag 


ttggatggta 


tcctgatgta 


gcaggtgata 


ctgttgtctt 


agaagttctt 


480 


gaaggctctg 


gagctgatgt 


ttctggactt 


ttagcagacg 


caggagtcga 


tgcatccgcg 


540 


gttgaagtga 


ccacgtcaga 


tcagcctgaa 


ctctatgccg 


atatcattgg 


aggcctagcg 


600 


tacacaatgg 


gtggtcgctg 


cagcgtagga 


tttgcagcca 


caaatgcagc 


tggacaacct 


660 


ggcttcgtga 


cagctggaca 


ttgcggccgc 


gtcggtacac 


aggttactat 


cggcaatgga 


720 


agaggtgtct 


ttgagcaaag 


cgtatttccc 


gggaatgatg 


ctgccttcgt 


tagaggtacg 


780 


tccaacttta 


cgcttactaa 


cttagtatct 


agatacaaca 


ctggcggata 


tgcaactgta 


840 


gcaggtcaca 


atcaagcacc 


tattggctct 


agcgtctgcc 


gctcagggtc 


gactacagga 


900 


tggcattgtg 


gaaccattca 


agctagaggt 


cagagcgtga 


gctatcctga 


aggtaccgta 


960 


acgaacatga 


ctcgtacgac 


tgtatgtgca 


gaaccaggtg 


actctggagg 


ttcatatatc 


1020 


agcggtacgc 


aagcgcaagg 


cgttacctca 


ggtggatccg 


gtaactgtag 


gacaggtggc 


1080 



2 



acaacgttct accaggaagt gacaccgatg gtgaactctt ggggagttag actccgtaca 1140 
taa 1143 



<210> 3 

<211> 8 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Oterminal amino acid tail expressed as fusion to protease of the 
invention. 

<400> 3 

Gin Ser His Val Gin Ser Ala Pro 
1 5 



<210> 4 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polynucleotide encoding a C-terminal amino acid tail expressed as 
fusion to protease of the invention. 

<400> 4 

caatcgcatg ttcaatccgc tcca 24 



<210> 5 

<211> 4 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> C-terminal amino acid tail expressed as fusion to protease of the 
invention. 



<400> 5 



Gin Ser Ala Pro 
1 



<210> 6 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polynucleotide encoding a C-terminal amino acid tail expressed as 
fusion to protease of the invention. 



3 



<400> 6 
caatcggctc ct 



12 



<210> 7 

<211> 2 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> C-terminal amino acid tail expressed as fusion to protease of the 
invention. 

<400> 7 

Gin Pro 
1 



<210> 8 

<211> 6 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polynucleotide encoding a C-terminal amino acid tail expressed as 
fusion to protease of the invention. 

<400> 8 

caacca 6 



<210> 9 

<211> 1 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> C-terminal amino acid tail expressed as fusion to protease of the 
invention. 

<400> 9 

Pro 
1 



<210> 10 

<211> 3 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polynucleotide encoding a C-terminal amino acid tail expressed as 
fusion to protease of the invention. 



4 



<400> 



10 



cca 



3 



<210> 11 

<211> 45 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #252639 

<400> 11 

catgtgcatg tgggtaccgc aacgttcgca gatgctgctg aagag 45 

<210> 12 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #251992 



<210> 13 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179541 

<400> 13 

gcgttgagac gcgcggccgc gagcgccgtt tggctgaatg atac 4 4 

<210> 14 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179542 



<400> 12 

catgtgcatg tggtcgaccg attatggagc ggattgaaca tgcg 



44 



<400> 14 

gcgttgagac agctcgagca gggaaaaatg gaaccgcttt ttc 



43 



<210> 15 
<211> 64 
<212> DNA 



<213> Artificial sequence 



5 



<220> 

<223> Primer #179539 



<400> 15 

ccatttgatc agaattcact ggccgtcgtt ttacaaccat tgcggaaaat agtcataggc 



60 



atcc 



64 



<210> 16 

<211> 60 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179540 

<400> 16 

ggatccagat ctggtacccg ggtctagagt cgacgcggcg gttcgcgtcc ggacagcaca 60 

<210> 17 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179154 



<210> 18 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179153 

<400> 18 

ccgcgtcgac actagacacg ggtacctgat ctagatc 37 

<210> 19 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #317 



<400> 17 

gttgtaaaac gacggccagt gaattctgat caaatgg 



37 



<400> 19 

tggcgcaatc ggtaccatgg gg 



22 
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<210> 20 

<211> 40 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #139 NotI 

<400> 20 

catgtgcatg cggccgcatt aacgcgttgc cgcttctgcg 40 

<210> 21 
<211> 7443 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Sequence of plasmid pMB1508 
<400> 21 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgataaaagt gctttttttg 420 

ttgcaattga agaattatta atgttaagct taattaaaga taatatcttt gaattgtaac 480 

gcccctcaaa agtaagaact acaaaaaaag aatacgttat atagaaatat gtttgaacct 540 

tcttcagatt acaaatatat tcggacggac tctacctcaa atgcttatct aactatagaa 600 

tgacatacaa gcacaacctt gaaaatttga aaatataact accaatgaac ttgttcatgt 660 

gaattatcgc tgtatttaat tttctcaatt caatatataa tatgccaata cattgttaca 720 

agtagaaatt aagacaccct tgatagcctt actataccta acatgatgta gtattaaatg 780 

aatatgtaaa tatatttatg ataagaagcg acttatttat aatcattaca tatttttcta 840 

ttggaatgat taagattcca atagaatagt gtataaatta tttatcttga aaggagggat 900 

gcctaaaaac gaagaacatt aaaaacatat atttgcaccg tctaatggat ttatgaaaaa 960 

tcattttatc agtttgaaaa ttatgtatta tggagctctg aaaaaaagga gaggataaag 1020 

aatgaagaaa ccgttgggga aaattgtcgc aagcaccgca ctactcattt ctgttgcttt 1080 
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tagttcatcg 


atcgcatcgg 


ctgctgaaga 


agcaaaagaa 


aaatatttaa 


ttggctttaa 


1140 


tgagcaggaa 


gctgtcagtg 


agtttgtaga 


acaagtagag 


gcaaatgacg 


aggtcgecat 


1200 


tctctctgag 
zu j 


gaagaggaag 

zz? zj 3 3 zj 


tcgaaattga 

3 3 


attgcttcat 


gaatttgaaa 


cgattcctgt 


1260 


tttatccgtt 


gagttaagcc 


cagaagatgt 


ggacgcqctt 

3 3 ZJ ZD 


gaactcgatc 


cagegattte 


1320 


ttatattgaa 


qaqqatqcaq 


aagtaacgac 


aatggcgcaa 


teggtaccat 


qaaatatatc 

ZDZZ7ZDZD ^ 


1380 


aacgcgttaa 


tccgcggata 


tatagcggcc 


qcaaatctaq 


gaccaataat 


aatgactaga 


1440 


gaagaaagaa 


tgaagattgt 


tcatgaaatt 


aaggaacgaa 


tattggataa 


agtgggatat 


1500 


ttttaaaata 


tatatttatg 


ttacagtaat 


attgactttt 


aaaaaaggat 


tgattctaat 


1560 


gaagaaagca 


gacaagtaag 


cctcctaaat 


tcactttaga 


taaaaattta 


ggaggcatat 


1620 


caaatgaact 


ttaataaaat 


tgatttagac 


aattggaaga 


gaaaagagat 


atttaatcat 


1680 


tatttgaacc 


aacaaacgac 


ttttagtata 


accacagaaa 


ttgatattag 


tgttttatac 


1740 


cgaaacataa 


aacaagaagg 


atataaattt 


taccctgcat 


ttattttctt 


aqtqacaaqq 

3 3^ l ^^ l ^ l 33 


1800 


gtgataaact 


caaatacagc 


ttttagaact 


ggttacaata 


gegaeggaga 


gttaggt tat 


1860 


tgggataagt 


tagagccact 


t tatacaat t 


t t tqatqqtq 


t atctaaaac 


attctctggt 


1920 


atttggactc 


ctgtaaagaa 


tgacttcaaa 


gagttttatg 


atttatacct 


ttctgatgta 


1980 


gagaaatata 


atqqttcqqq 

^3 3 ^^333 


gaaattgttt 


cccaaaacac 


ctatacctga 


aaatgctttt 


2040 


tctctttcta 


ttattccatg 


gacttcattt 


actgggttta 


acttaaatat 


caataataat 


2100 


agtaattacc 


t tctacccat 


tat t acagca 


ggaaaat tea 


ttaataaagg 


taat tcaata 


2160 


tatttaccgc 


tatctttaca 


ggtacatcat 


tctgtttgtg 


atggttatca 


tacaqqattq 


2220 


tttatgaact 


ctattcagga 


attgtcagat 


aggectaatg 


actggctttt 


ataatatgag 


2280 


ataatgccga 


ctgtactttt 


tacagtcggt 


tttctaacga 


tacattaata 


ggtacgaaaa 


2340 


agcaactttt 


tttgcgctta 


aaaccagtca 


taccaataac 


t taagggtaa 


ct agcctcgc 


2400 


cqqaaaqaqc 


gaaaatgcct 


cacatttgtg 


ccacctaaaa 


aqqaqeqatt 


tacatatgag 


2460 


ttatgcagtt 


tgtagaatgc 


aaaaagtgaa 


atcagctgga 


ctaaaaqqqq 

3333 


ccgcagagta 


2520 


gaatggaaaa 


qqqqatcqqa 


aaacaagtat 


ataqqaqqaq 


acctatttat 


ggct t cagaa 


2580 


aaagacgcag 


gaaaacagtc 


agcagtaaag 


cttgttccat 


tgettattae 


tgtcgctgtg 


2640 


ggactaatca 


tctggtttat 


tcccgctccg 


teeggacttg 


aacctaaagc 


ttggcatttg 


2700 


tttgcgattt 


ttgtcgcaac 


aattatcggc 


tttatctcca 


agcccttgcc 


aatgggtgca 


2760 


attgcaattt 


ttgcattggc 


ggttactgca 


ctaactggaa 


cactatcaat 


tgaggataca 


2820 



ttaagcggat 


tcgggaataa 


gaccatttgg 


cttatcgtta 


tcgcattctt 


tatttcccgg 


2880 


ggatttatca 


aaaccggtct 


cggtgcgaga 


atttcgtatg 


tattcgttca 


gaaattcgga 


2940 


aaaaaaaccc 


ttggactttc 


ttattcactg 


ctattcagtg 


atttaatact 


ttcacctgct 


3000 


attccaagta 


atacggcgcg 


tgcaggaggc 


attatatttc 


ctattatcag 


atcattatcc 


3060 


gaaacattcg 


gatcaagccc 


ggcaaatgga 


acagagagaa 


aaatcggtgc 


attcttatta 


3120 


aaaaccggtt 


ttcaggggaa 


tctgatcaca 


tctgctatgt 


tcctgacagc 


gatggcggcg 


3180 


aacccgctga 


ttgccaagct 


ggcccatgat 


gtcgcagggg 


tggacttaac 


atggacaagc 


3240 


tgggcaattg 


ccgcgattgt 


accgggactt 


gtaagcttaa 


tcatcacgcc 


gcttgtgatt 


3300 


tacaaactgt 


atccgccgga 


aatcaaagaa 


acaccggatg 


cggcgaaaat 


cgcaacagaa 


3360 


aaactgaaag 


aaatgggacc 


gttcaaaaaa 


tcggagcttt 


ccatggttat 


cgtgtttctt 


3420 


ttggtgcttg 


tgctgtggat 


ttttggcggc 


agcttcaaca 


tcgacgctac 


cacaaccgca 


3480 


ttgatcggtt 


tggccgttct 


cttattatca 


caagttctga 


cttgggatga 


tatcaagaaa 


3540 


gaacagggcg 


cttgggatac 


gctcacttgg 


tttgcggcgc 


ttgtcatgct 


cgccaacttc 


3600 


ttgaatgaat 


taggcatggt 


gtcttggttc 


agtaatgcca 


tgaaatcatc 


cgtatcaggg 


3660 


ttctcttgga 


ttgtggcatt 


catcatttta 


attgttgtgt 


attattactc 


tcactatttc 


3720 


tttgcaagtg 


cgacagccca 


catcagtgcg 


atgtattcag 


catttttggc 


tgtcgtcgtg 


3780 


gcagcgggcg 


caccgccgct 


tttagcagcg 


ctgagcctcg 


cgttcatcag 


caacctgttc 


3840 


gggtcaacga 


ctcactacgg 


ttctggagcg 


gctccggtct 


tcttcggagc 


aggctacatc 


3900 


ccgcaaggca 


aatggtggtc 


catcggattt 


atcctgtcga 


ttgttcatat 


catcgtatgg 


3960 


cttgtgatcg 


gcggattatg 


gtggaaagta 


ctaggaatat 


ggtagaaaga 


aaaaggcaga 


4020 


cgcggtctgc 


ctttttttat 


tttcactcct 


tcgtaagaaa 


atggattttg 


aaaaatgaga 


4080 


aaattccctg 


tgaaaaatgg 


tatgatctag 


gtagaaagga 


cggctggtgc 


tgtggtgaaa 


4140 


aagcggttcc 


atttttccct 


gcaaacaaaa 


ataatggggc 


tgattgcggc 


tctgctggtc 


4200 


tttgtcattg 


gtgtgctgac 


cattacgtta 


gccgttcagc 


atacacaggg 


agaacggaga 


4260 


caggcagagc 


agctggcggt 


tcaaacggcg 


agaaccattt 


cctatatgcc 


gccggttaaa 


4320 


gagctcattg 


agagaaaaga 


cggacatgcg 


gctcagacgc 


aagaggtcat 


tgaacaaatg 


4380 


aaagaacaga 


ctggtgcgtt 


tgccatttat 


gttttgaacg 


aaaaaggaga 


cattcgcagc 


4440 


gcctctggaa 


aaagcggatt 


aaagaaactg 


gagcgcagca 


gagaaatttt 


gtttggcggt 


4500 
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tcgcatgttt 


ctgaaacaaa 


agcggatgga 


cgaagagtga 


tcagagggag 


cgcgccgatt 


4560 


ataaaagaac 


agaagggata 


cagccaagtg 


atcggcagcg 


tgtctgttga 


ttttctgcaa 


4620 


acggagacag 


agcaaagcat 


caaaaagcat 


ttgagaaatt 


tgagtgtgat 


tgctgtgctt 


4680 


gtactgctgc 


tcggatttat 


tggcgccgcc 


gtgctggcga 


aaagcatcag 


aaaggatacg 


4740 


ctcgggcttg 


aaccgcatga 


gatcgcggct 


ctatatcgtg 


agaggaacgc 


aatgcttttc 


4800 


gcgattcgag 


aagggattat 


tgccaccaat 


cgtgaaggcg 


tcgtcaccat 


gatgaacgta 


4860 


tcggcggccg 


agatgctgaa 


gctgcccgag 


cctgtgatcc 


atcttcctat 


agatgacgtc 


4920 


atgccgggag 


cagggctgat 


gtctgtgctt 


gaaaaaggag 


aaatgctgcc 


gaaccaggaa 


4980 


gtaagcgtca 


acgatcaagt 


gtttattatc 


aatacgaaag 


tgatgaatca 


aggcggqcag 


5040 


gcgtatggga 


ttgtcgtcag 


cttcagggag 


aaaacagagc 


tgaagaagct 


gatcgacaca 


5100 


ttgacagagg 


ttcgcaaata 


ttcagaggat 


ctcagggcgc 


agactcatga 


attttcaaat 


5160 


aagctttatg 


cgattttagg 


gctgcgtcga 


cctgcaggca 


tgcaagcttg 


gcgtaatcat 


5220 


ggtcatagct 


gtttcctgtg 


tgaaattgtt 


atccgctcac 


aattccacac 


aacatacgag 


5280 


ccggaagcat 


aaagtgtaaa 


gcctggggtg 


cctaatgagt 


gagctaactc 


acattaattg 


5340 


cgttgcgctc 


actgcccgct 


ttccagtcgg 


gaaacctgtc 


gtgccagctg 


cattaatgaa 


5400 


tcggccaacg 


cgcggggaga 


ggcggtttgc 


gtattgggcg 


ctcttccgct 


tcctcgctca 


5460 


ctgactcgct 


gcgctcggtc 


gttcggctgc 


ggcgagcggt 


atcagctcac 


tcaaaggcgg 


5520 


taatacggtt 


atccacagaa 


tcaggggata 


acgcaggaaa 


gaacatgtga 


gcaaaaggcc 


5580 


agcaaaaggc 


caggaaccgt 


aaaaaggccg 


cgttgctggc 


gtttttccat 


aggctccgcc 


5640 


cccctgacga 


gcatcacaaa 


aatcgacgct 


caagtcagag 


gtggcgaaac 


ccgacaggac 


5700 


tataaagata 


ccaggcgttt 


ccccctggaa 


gctccctcgt 


gcgctctcct 


gttccgaccc 


5760 


tgccgcttac 


cggatacctg 


tccgcctttc 


tcccttcggg 


aagcgtggcg 


ctttctcata 


5820 


gctcacgctg 


taggtatctc 


agttcggtgt 


aggtcgttcg 


ctccaagctg 


ggctgtgtgc 

J Zt ZJ ZJ ZJ 


5880 


acgaaccccc 


cgttcagccc 


gaccgctgcg 


ccttatccgg 


taactatcgt 


cttgagtcca 


5940 


acccggtaag 


acacgactta 


tcgccactgg 


cagcagccac 


tggtaacagg 


attagcagag 


6000 


cgaggtatgt 


aggcggtgct 


acagagttct 


tgaagtggtg 


gcctaactac 


ggctacacta 


6060 


gaaggacagt 


atttggtatc 


tgcgctctgc 


tgaagccagt 


taccttcgga 


aaaagagttg 


6120 


gtagctcttg 


atccggcaaa 


caaaccaccg 


ctggtagcgg 


tggttttttt 


gtttgcaagc 


6180 


agcagattac 


gcgcagaaaa 


aaaggatctc 


aagaagatcc 


tttgatcttt 


tctacggggt 


6240 



ctgacgctca 


gtggaacgaa 


aactcacgtt 


aagggatttt 


ggtcatgaga 


ttatcaaaaa 


6300 


ggatcttcac 


ctagatcctt 


ttaaattaaa 


aatgaagttt 


taaatcaatc 


taaagtatat 


6360 


atgagtaaac 


ttggtctgac 


agttaccaat 


gcttaatcag 


tgaggcacct 


atctcagcga 


6420 


tctgtctatt 


tcgttcatcc 


atagttgcct 


gactccccgt 


cgtgtagata 


actacgatac 


6480 


gggagggctt 


accatctggc 


cccagtgctg 


caatgatacc 


gcgagaccca 


cgctcaccgg 


6540 


caccggattt 


atcagcaata 


aaccagccag 


ccggaagggc 


cgagcgcaga 


agtggtcctg 


6600 


caactttatc 


cgcctccatc 


cagtctatta 


attgttgccg 


ggaagctaga 


gtaagtagtt 


6660 


cgccagttaa 


tagtttgcgc 


aacgttgttg 


ccattgctac 


aggcatcgtg 


gtgtcacgct 


6720 


cgtcgtttgg 


tatggcttca 


ttcagctccg 


gttcccaacg 


atcaaggcga 


gttacatgat 


6780 


cccccatgtt 


gtgcaaaaaa 


gcggttagct 


ccttcggtcc 


tccgatcgtt 


gtcagaagta 


6840 


agttggccgc 


agtgttatca 


ctcatggtta 


tggcagcact 


gcataattct 


cttactgtca 


6900 


tgccatccgt 


aagatgcttt 


tctgtgactg 


gtgagtactc 


aaccaagtca 


ttctgagaat 


6960 


agtgtatgcg 


gcgaccgagt 


tgctcttgcc 


cggcgtcaat 


acgggataat 


accgcgccac 


7020 


atagcagaac 


tttaaaagtg 


ctcatcattg 


gaaaacgttc 


ttcggggcga 


aaactctcaa 


7080 


ggatcttacc 


gctgttgaga 


tccagttcga 


tgtaacccac 


tcgtgcaccc 


aactgatctt 


7140 


cagcatcttt 


tactttcacc 


agcgtttctg 


ggtgagcaaa 


aacaggaagg 


caaaatgccg 


7200 


caaaaaaggg 


aataagggcg 


acacggaaat 


gttgaatact 


catactcttc 


ctttttcaat 


7260 


attattgaag 


catttatcag 


ggttattgtc 


tcatgagcgg 


atacatattt 


gaatgtattt 


7320 


agaaaaataa 


acaaataggg 


gttccgcgca 


catttccccg 


aaaagtgcca 


cctgacgtct 


7380 


aagaaaccat 


tattatcatg 


acattaacct 


ataaaaatag 


gcgtatcacg 


aggccctttc 


7440 



gtc 7443 

<210> 22 
<211> 5718 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Sequence of MB1510 genomic integration region 
<400> 22 

gagcgccgtt tggctgaatg atacaacagt ctcacttcct tactgcgtct ggttgcaaaa 60 
acgaagaagc aaggattccc ctcgcttctc atttgtccta tttattatac acttttttaa 120 
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gcacatcttt 


ggcgcttgtt 


tcactagact 


tgatgcctct 


gaatcttgtc 


caagtgtcac 


180 


ggtccgcatc 


atagacttgt 


ccatttttca 


ccgctttgag 


atttttccag 


agcgggttcg 


240 


ttttccactc 


atctacaatg 


gttttgcctt 


cgttggctga 


gatgaacaaa 


atatcaggat 


300 


cgattttgct 


caattgctca 


aggctgacct 


cttgataggc 


gttatctgac 


ttcacagcgt 


360 


gtgtaaagcc 


tagcatttta 


aagatttctc 


cgtcatagga 


tgatgatgta 


tgaagctgga 


420 


aggaatccgc 


tcttgcaacg 


ccgagaacga 


tgttgcggtt 


ttcatctttc 


ggaagttcgg 


480 


cttttagatc 


gttgatgact 


tttttgtgct 


cggcaagctt 


ttcttttcct 


tcatcttctt 


540 


tatttaatgc 


tttagcaatg 


gtcgtaaagc 


tgtcgatcgt 


ttcgtcatat 


gtcgcttcac 


600 


ggctttttaa 


ttcaatcgtc 


ggggcgattt 


ttttcagctg 


tttataaatg 


tttttatggc 


660 


gctcagcgtc 


agcgatgatt 


aaatcaggct 


tcaaggaact 


gatgacctca 


agattgggtt 


720 


cgctgcgtgt 


gcctacagat 


gtgtaatcaa 


tggagctgcc 


gacaagcttt 


ttaatcatat 


780 


cttttttgtt 


gtcatctgcg 


atgcccaccg 


gcgtaatgcc 


gagattgtga 


acggcatcca 


840 


agaatgaaag 


ctcaagcaca 


accacccgct 


taggtgtgcc 


gcttactgtc 


gtttttcctt 


900 


cttcgtcatg 


gatcactctg 


gaatccttag 


actcgctttt 


gccgcttccg 


ttgttattct 


960 


ggcttgatga 


acagccggat 


acaatgaggc 


aggcgagcaa 


taaaacactc 


atgatggcaa 


1020 


tcaacttgtt 


agaataggtg 


cgcatgtcat 


tcttcctttt 


ttcagattta 


gtaatgagaa 


1080 


tcattatcac 


atgtaacact 


ataatagcat 


ggcttatcat 


gtcaatattt 


ttttagtaaa 


1140 


gaaagctgcg 


tttttactgc 


tttctcatga 


aagcatcatc 


agacacaaat 


aagtggtatg 


1200 


cagcgttacc 


gtgtcttcga 


gacaaaaacg 


catgggcgtt 


ggctttagag 


gtttcgaaca 


1260 


tatcagcagt 


gacataagga 


aggagagtgc 


tgagataacc 


ggacaatttc 


ttttctattt 


1320 


catctgttag 


tgcaaattca 


atgtcgccga 


tattcatgat 


aatcgagaaa 


acaaagtcga 


1380 


tatcgatatg 


aaaatgttcc 


tcggcaaaaa 


ccgcaagctc 


gtgaattcct 


ggtgaacatc 


1440 


cggcacgctt 


atggaaaatc 


tgtttgacta 


aatcactcac 


aatccaagca 


ttgtattgct 


1500 


gttctggtga 


aaagtattgc 


attagacata 


cctcctgctc 


gtacggataa 


aggcagcgtt 


1560 


tcatggtcgt 


gtgctccgtg 


cagcggcttc 


tccttaattt 


tgatttttct 


gaaaataggt 


1620 


cccgttccta 


tcactttacc 


atggacggaa 


aacaaatagc 


tactaccatt 


cctcctgttt 


1680 


ttctcttcaa 


tgttctggaa 


tctgtttcag 


gtacagacga 


tcgggtatga 


aagaaatata 


1740 


gaaaacatga 


aggaggaata 


tcgacatgaa 


accagttgta 


aaagagtata 


caaatgacga 


1800 


acagctcatg 


aaagatgtag 


aggaattgca 


gaaaatgggt 


gttgcgaaag 


aggatgtata 


1860 



cgtcttagct 


cacgacgatg 


acagaacgga 


acgcctggct 


gacaacacga 


acgccaacac 


1920 


gatcggagcc 


aaagaaacag 


gtttcaagca 


cgcggtggga 


aatatcttca 


ataaaaaagg 


1980 


agacgagctc 


cgcaataaaa 


ttcacgaaat 


cggtttttct 


gaagatgaag 


ccgctcaatt 


2040 


tgaaaaacgc 


ttagatgaag 


gaaaagtgct 


tctctttgtg 


acagataacg 


aaaaagtgaa 


2100 


agcttgggca 


taaagcaagg 


aaaaaaccaa 


aaggccaatg 


tcggcctttt 


ggtttttttg 


2160 


cggtctttgc 


ggtgggattt 


tgcagaatgc 


cgcaatagga 


tagcggaaca 


ttttcggttc 


2220 


tgaatgtccc 


tcaatttgct 


attatatttt 


tgtgataaat 


tggaataaaa 


tctcacaaaa 


2280 


tagaaaatgg 


gggtacatag 


tggatgaaaa 


aagtgatgtt 


agctacggct 


ttgtttttag 


2340 


gattgactcc 


agctggcgcg 


aacgcagctg 


atttaggcca 


ccagacgttg 


ggatccaatg 


2400 


atggctgggg 


cgcgtactcg 


accggcacga 


caggcggatc 


aaaagcatcc 


tcctcaaatg 


2460 


tgtataccgt 


cagcaacaga 


aaccagcttg 


tctcggcatt 


agggaaggaa 


acgaacacaa 


2520 


cgccaaaaat 


catttatatc 


aagggaacga 


ttgacatgaa 


cgtggatgac 


aatctgaagc 


2580 


cgcttggcct 


aaatgactat 


aaagatccgg 


agtatgattt 


ggacaaatat 


ttgaaagcct 


2640 


atgatcctag 


cacatggggc 


aaaaaagagc 


cgtcgggaac 


acaagaagaa 


gcgagagcac 


2700 


gctctcagaa 


aaaccaaaaa 


gcacgggtca 


tggtggatat 


ccctgcaaac 


acgacgatcg 


2760 


tcggttcagg 


gactaacgct 


aaagtcgtgg 


gaggaaactt 


ccaaatcaag 


agtgataacg 


2820 


tcattattcg 


caacattgaa 


ttccaggatg 


cctatgacta 


ttttccgcaa 


tggttgtaaa 


2880 


acgacggcca 


gtgaattctg 


atcaaatggt 


tcagtgagag 


cgaagcgaac 


acttgatttt 


2940 


ttaattttct 


atcttttata 


ggtcattaga 


gtatacttat 


ttgtcctata 


aactatttag 


3000 


cagcataata 


gatttattga 


ataggtcatt 


taagttgagc 


atattagagg 


aggaaaatct 


3060 


tggagaaata 


tttgaagaac 


ccgagatcta 


gatcaggtac 


cgcaacgttc 


gcagatgctg 


3120 


ctgaagagat 


tattaaaaag 


ctgaaagcaa 


aaggctatca 


attggtaact 


gtatctcagc 


3180 


ttgaagaagt 


gaagaagcag 


agaggctatt 


gaataaatga 


gtagaaagcg 


ccatatcggc 


3240 


gcttttcttt 


tggaagaaaa 


tatagggaaa 


atggtacttg 


ttaaaaattc 


ggaatattta 


3300 


tacaatatca 


tatgtatcac 


attgaaagga 


ggggcctgct 


gtccagactg 


tccgctgtgt 


3360 


aaaaataagg 


aataaagggg 


ggttgacatt 


attttactga 


tatgtataat 


ataatttgta 


3420 


taagaaaatg 


gaggggccct 


cgaaacgtaa 


gatgaaacct 


tagataaaag 


tgcttttttt 


3480 


gttgcaattg 


aagaattatt 


aatgttaagc 


ttaattaaag 


ataatatctt 


tgaattgtaa 


3540 



cgcccctcaa 


aagtaagaac 


tacaaaaaaa 


gaatacgtta 


tatagaaata 


tgtttgaacc 


3600 


ttcttcagat 


tacaaatata 


ttcggacgga 


ctctacctca 


aatgcttatc 


taactataga 


3660 


atgacataca 


agcacaacct 


tgaaaatttg 


aaaatataac 


taccaatgaa 


cttgttcatg 


3720 


tgaattatcg ctgtatttaa 


ttttctcaat 


tcaatatata 


atatgccaat 


acattgttac 


3780 


aagtagaaat 


taagacaccc 


ttgatagcct 


tactatacct 


aacatgatgt 


agtattaaat 


3840 


gaatatgtaa 


atatatttat 


gataagaagc 


gacttattta 


taatcattac 


atatttttct 


3900 


attggaatga 


ttaagattcc 


aatagaatag 


tgtataaatt 


atttatcttg 


aaaggaggqa 


3960 


tgcctaaaaa 


cgaagaacat 


taaaaacata 


tatttgcacc 


gtctaatgga 


tttatgaaaa 


4020 


atcattttat 


cagtttgaaa 


attatgtatt 


atggagctct 


gaaaaaaagg 


agaggataaa 


4080 


gagaaaaggg 


gatcggaaaa 


caagtatata 


ggaggagacc 


tatttatggc 


ttcagaaaaa 


4140 


gacgcaggaa 


aacagtcagc 


agtaaagctt 


gttccattgc 


ttattactgt 


cgctgtggga 


4200 


ctaatcatct 


ggtttattcc 


cgctccgtcc 


ggacttgaac 


ctaaagcttg 


gcatttgttt 


4260 


gcgatttttg 


tcgcaacaat 


tatcggcttt 


atctccaagc 


ccttgccaat 


gggtgcaatt 


4320 


gcaatttttg 


cattggcggt 


tactgcacta 


actggaacac 


tatcaattga 


ggatacatta 


4380 


agcggattcg 


ggaataagac 


catttggctt 


atcgttatcg 


cattctttat 


ttcccgggga 


4440 


tttatcaaaa 


ccggtctcgg 


tgcgagaatt 


tcgtatgtat 


tcgttcagaa 


attcggaaaa 


4500 


aaaacccttg 


gactttctta 


ttcactgcta 


ttcagtgatt 


taatactttc 


acctgctatt 


4560 


ccaagtaata 


cggcgcgtgc 


aggaggcatt 


atatttccta 


ttatcagatc 


attatccgaa 


4620 


acattcggat 


caagcccggc 


aaatggaaca 


gagagaaaaa 


tcggtgcatt 


cttattaaaa 


4680 


accggttttc 


aggggaatct 


gatcacatct 


gctatgttcc 


tgacagcgat 


ggcggcgaac 


4740 


ccgctgattg 


ccaagctggc 


ccatgatgtc 


gcaggggtgg 


acttaacatg 


gacaagctgg 


4800 


gcaattgccg 


cgattgtacc 


gggacttgta 


agcttaatca 


tcacgccgct 


tgtgatttac 


4860 


aaactgtatc 


cgccggaaat 


caaagaaaca 


ccggatgcgg 


cgaaaatcgc 


aacagaaaaa 


4920 


ctgaaagaaa 


tgggaccgtt 


caaaaaatcg 


gagctttcca 


tggttatcgt 


gtttcttttg 


4980 


gtgcttgtgc 


tgtggatttt 


tggcggcagc 


ttcaacatcg 


acgctaccac 


aaccgcattg 


5040 


atcggtttgg 


ccgttctctt 


attatcacaa 


gttctgactt 


gggatgatat 


caagaaagaa 


5100 


cagggcgctt 


gggatacgct 


cacttggttt 


gcggcgcttg 


tcatgctcgc 


caacttcttg 


5160 


aatgaattag 


gcatggtgtc 


ttggttcagt 


aatgccatga 


aatcatccgt 


atcagggttc 


5220 


tcttggattg 


tggcattcat 


cattttaatt 


gttgtgtatt 


attactctca 


ctatttcttt 


5280 



gcaagtgcga cagcccacat cagtgcgatg tattcagcat ttttggctgt cgtcgtggca 
gcgggcgcac cgccgctttt agcagcgctg agcctcgcgt tcatcagcaa cctgttcggg 
tcaacgactc actacggttc tggagcggct ccggtcttct tcggagcagg ctacatcccg 
caaggcaaat ggtggtccat cggatttatc ctgtcgattg ttcatatcat cgtatggctt 
gtgatcggcg gattatggtg gaaagtacta ggaatatggt agaaagaaaa aggcagacgc 
ggtctgcctt tttttatttt cactccttcg taagaaaatg gattttgaaa aatgagaaaa 
ttccctgtga aaaatggtat gatctaggta gaaaggacgg ctggtgctgt ggtgaaaaag 
cggttccatt tttccctg 



<210> 23 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1605 

<400> 23 

gacggccagt gaattcgata aaagtgc 

<210> 24 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1606 



<220> 

<221> misc_f eature 

<222> (13) . . (13) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc__f eature 

<222> (16).. (16) 

<223> n is a, c, g, or t 

<400> 24 

ccagatctct atnktnktgt acggagtcta actccccaag ag 

<210> 25 

<211> 1112 

<212> DNA 

<213> Nocardiopsis dassonvillei DSM 43235 
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<400> 25 
gcttttagtt 


catcgatcgc 


atcggctgct 


ccggcccccg 


tcccccagac 


ccccgtcgcc 


60 


gacgacagcg 


ccgccagcat 


gaccgaggcg 


ctcaagcgcg acctcgacct 


cacctcggcc 


120 


gaggccgagg 


agcttctctc 


ggcgcaggaa 


gccgccatcg 


agaccgacgc 


cgaggccacc 


180 


gaggccgcgg 


gcgaggccta 


cggcggctca 


ctgttcgaca 


ccgagaccct 


cgaactcacc 


240 


gtgctggtca 


ccgacgcctc 


cgccgtcgag 


gcggtcgagg 


ccaccggagc 


ccaggccacc 


300 


gtcgtctccc 


acggcaccga 


gggcctgacc 


gaggtcgtgg 


aggacctcaa 


cggcgccgag 


360 


gttcccgaga 


gcgtcctcgg 


ctggtacccg 


gacgtggaga 


gcgacaccgt 


cgtggtcgag 


420 


gtgctggagg 


gctccgacgc 


cgacgtcgcc 


gccctgctcg 


ccgacgccgg 


tgtggactcc 


480 


tcctcggtcc 


gggtggagga 


ggccgaggag 


gccccgcagg 


tctacgccga 


catcatcggc 


540 


ggcctggcct 


actacatggg 


cggccgctgc 


tccgtcggct 


tcgccgcgac 


caacagcgcc 


600 


ggtcagcccg 


gtttcgtcac 


cgccggccac 


tgcggcaccg 


tcggcaccgg 


cgtgaccatc 


660 


ggcaacggca 


ccggcacctt 


ccagaactcg 


gtcttccccg 


gcaacgacgc 


cgccttcgtc 


720 


cgcggcacct 


ccaacttcac 


cctgaccaac 


ctggtctcgc 


gctacaactc 


cggcggctac 


780 


cagtcggtga 


ccggtaccag 


ccaggccccg 


gccggctcgg 


ccgtgtgccg 


ctccggctcc 


840 


accaccggct 


ggcactgcgg 


caccatccag 


gcccgcaacc 


agaccgtgcg 


ctacccgcag 


900 


ggcaccgtct 


actcgctcac 


ccgcaccaac 


gtgtgcgccg 


agcccggcga 


ctccggcggt 


960 


tcgttcatct 


ccggctcgca 


ggcccagggc 


gtcacctccg 


gcggctccgg 


caactgctcc 


1020 


gtcggcggca 


cgacctacta 


ccaggaggtc 


accccgatga 


tcaactcctg 


gggtgtcagg 


1080 


atccggacct 


aatcgcatgt 


tcaatccgct 


cc 
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<210> 26 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1423 

<400> 26 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccag 48 

<210> 27 

<211> 45 

<212> DNA 

<213> Artificial sequence 

16 



<220> 

<223> Primer 1475 



<400> 27 

ggagcggatt gaacatgcga ttaggtccgg atcctgacac cccag 



<210> 28 

<211> 354 

<212> PRT 

<213> Nocardiopsis dassonvillei DSM 43235 



<220> 

<221> PROPEP 
<222> (1)..(166) 

<220> 

<221> mat_peptide 
<222> (167) . . (354) 

<400> 28 

Ala Pro Ala Pro Val Pro Gin Thr Pro Val Ala Asp Asp Ser Ala 
-165 -160 -155 



Ala Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser 
-150 -145 -140 



Ala Glu Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala lie Glu 
-135 -130 -125 



Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
-120 -115 -110 



Ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 



Ala Ser Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr Val 
-90 -85 -80 -75 



Val Ser His Gly Thr Glu Gly Leu Thr Glu Val Val Glu Asp Leu Asn 
-70 -65 -60 



Gly Ala Glu Val Pro Glu Ser Val Leu Gly Trp Tyr Pro Asp Val Glu 
-55 -50 -45 



Ser Asp Thr Val Val Val Glu Val Leu Glu Gly Ser Asp Ala Asp Val 
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-40 



-35 



-30 



Ala Ala Leu Leu Ala Asp Ala Gly Val Asp Ser Ser Ser Val Arg Val 
-25 -20 -15 



Glu Glu Ala Glu Glu Ala Pro Gin Val Tyr Ala Asp He He Gly Gly 
-10 -5 -11 5 



Leu Ala Tyr Tyr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 
10 15 20 



Asn Ser Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr 
25 30 35 



Val Gly Thr Gly Val Thr He Gly Asn Gly Thr Gly Thr Phe Gin Asn 
40 45 50 



Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn 
55 60 65 70 



Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Gin 
75 80 " ' 85 



Ser Val Thr Gly Thr Ser Gin Ala Pro Ala Gly Ser Ala Val Cys Arg 
90 95 100 



Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Asn 
105 110 115 



Gin Thr Val Arg Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr 
120 125 130 



Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe He Ser Gly 
135 140 145 150 



Ser Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Val 
155 160 165 



Gly Gly Thr Thr Tyr Tyr Gin Glu Val Thr Pro Met He Asn Ser Trp 
170 175 180 



Gly Val Arg He Arg Thr 
185 
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<210> 29 

<211> 498 

<212> DNA 

<213> Nocardiopsis dassonvillei DSM 43235 

<400> 29 



gctccggccc 


ccgtccccca 


gacccccgtc 


gccgacgaca 


qcgccgccag 


catgaccgag 


60 


gcgctcaagc 


gcgacctcga 


cctcacctcg 


gccgaggccg 


aggagcttct 


ctcggcgcag 


120 


gaagccgcca 


tcgagaccga 


cgccgaggcc 


accgaggccg 


cgggcgaggc 


ctacggcggc 


180 


tcactgttcg 


acaccgagac 


cctcgaactc 


accgtgctgg 


tcaccgacgc 


ctccgccgtc 


240 


gaggcggtcg 


aggccaccgg 


agcccaggcc 


accgtcgtct 


cccacggcac 


cgagggcctg 


300 


accgaggtcg 


tggaggacct 


caacggcgcc 


gaggttcccg 


agagcgtcct 


cggctggtac 


360 


ccggacgtgg 


agagcgacac 


cgtcgtggtc 


gaggtgctgg 


agggctccga 


cgccgacgtc 


420 


gccgccctgc 


tcgccgacgc 


cggtgtggac 


tcctcctcgg 


tccgggtgga 


ggaggccgag 


480 


gaggccccgc 


aggtctac 










498 



<210> 30 
<211> 166 
<212> PRT 

<213> Nocardiopsis dassonvillei DSM 43235 
<400> 30 

Ala Pro Ala Pro Val Pro Gin Thr Pro Val Ala Asp Asp Ser Ala Ala 
15 10 ' 15 



Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser Ala Glu 
20 25 30 



Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala lie Glu Thr Asp Ala 
35 4 0 4 5 



Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly Ser Leu Phe Asp 
50 55 60 



Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ser Ala Val 
65 70 75 80 



Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr Val Val Ser His Gly 
85 90 95 
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Thr Glu Gly Leu Thr Glu Val Val Glu Asp Leu Asn Gly Ala Glu Val 
100 105 110 



Pro Glu Ser Val Leu Gly Trp Tyr Pro Asp Val Glu Ser Asp Thr Val 
115 120 125 



Val Val Glu Val Leu Glu Gly Ser Asp Ala Asp Val Ala Ala Leu Leu 
130 135 140 



Ala Asp Ala Gly Val Asp Ser Ser Ser Val Arg Val Glu Glu Ala Glu 
145 150 155 160 



Glu Ala Pro Gin Val Tyr 
165 



<210> 31 
<211> 1146 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> The DNA sequence coding for the pro-region of SEQ ID NO: 29 fused 
in frame to A1918L2 protease tail-variant encoding gene; whole 
construct: 10R (proA1918L2 ) . 

<400> 31 



atgaagaaac 


cgttggggaa 


aattgtcgca 


agcaccgcac 


tactcatttc 


tgttgctttt 


60 


agttcatcga 


tcgcatcggc 


tgctccggcc 


cccgtccccc 


agacccccgt 


cgccgacgac 


120 


agcgccgcca 


gcatgaccga 


ggcgctcaag 


cgcgacctcg 


acctcacctc 


ggccgaggcc 


180 


gaggagcttc 


tctcggcgca 


ggaagccgcc 


atcgagaccg 


acgccgaggc 


caccgaggcc 


240 


gcgggcgagg 


cctacggcgg 


ctcactgttc 


gacaccgaga 


ccctcgaact 


caccgtgctg 


300 


gtcaccgacg 


cctccgccgt 


cgaggcggtc 


gaggccaccg 


gagcccaggc 


caccgtcgtc 


360 


tcccacggca 


ccgagggcct 


gaccgaggtc 


gtggaggacc 


tcaacggcgc 


cgaggttccc 


420 


gagagcgtcc 


tcggctggta 


cccggacgtg 


gagagcgaca 


ccgtcgtggt 


cgaggtgctg 


480 


gagggctccg 


acgccgacgt 


cgccgccctg 


ctcgccgacg 


ccggtgtgga 


ctcctcctcg 


540 


gtccgggtgg 


aggaggccga 


ggaggccccg 


caggtctatg 


ccgatatcat 


tggaggccta 


600 


gcgtacacaa 


tgggtggtcg 


ctgcagcgta 


ggatttgcag 


ccacaaatgc 


agctggacaa 


660 


cctggcttcg 


tgacagctgg 


acattgcggc 


cgcgtcggta 


cacaggttac 


tatcggcaat 


720 


ggaagaggtg 


tctttgagca 


aagcgtattt 


cccgggaatg 


atgctgcctt 


cgttagaggt 


780 
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acgtccaact ttacgcttac taacttagta tctagataca acactggcgg atatgcaact 840 

gtagcaggtc acaatcaagc acctattggc tctagcgtct gccgctcagg gtcgactaca 900 

ggatggcatt gtggaaccat tcaagctaga ggtcagagcg tgagctatcc tgaaggtacc 960 

gtaacgaaca tgactcgtac gactgtatgt gcagaaccag gtgactctgg aggttcatat 1020 

atcagcggta cgcaagcgca aggcgttacc tcaggtggat ccggtaactg taggacaggt 1080 

ggcacaacgt tctaccagga agtgacaccg atggtgaact cttggggagt tagactccgt 1140 

acataa 1146 

<210> 32 

<211> 1068 

<212> DNA 

<213> Nocardiopsis Alba DSM 15647 

<400> 32 



gcgaccggcc 


ccctccccca 


gtcccccacc 


ccqqatqaaa 


ccgaggccac 


caccatggt c 


60 


gaggccctcc 


agcgcgacct 


cggcctgtcc 


ccctctcagg 


ccgacgagct 


cct cgaggcg 


120 


caggccgagt 


ccttcgagat 


cqacqaggcc 


qccaccqcqq 


ccgcagccga 


ctcct acggc 


180 


ggctccatct 


tcgacaccga 


cagcctcacc 


ctgaccgtcc 


tggtcaccga 


cqcctccqcc 


240 


gtcgaggcgg 


tcgaggccgc 


cggcgccgag 


gccaaggtgg 


tctcgcacgg 


catggagggc 


300 


ctggaggaga 


tcgtcgccga 


cctgaacgcg 


gccgacgctc 


agcccggcgt 


cgtgggctgg 


360 


taccccgaca 


tccactccga 


cacggtcgtc 


ctcgaggtcc 


tcgagggctc 


cggtgccgac 


420 


gtggactccc 


tgctcgccga 


cgccggtgtg 


gacaccgccg 


acgtcaaggt 


ggagagcacc 


480 


accgagcagc 


ccgagctgta 


cgccgacatc 


atcggcggtc 


tcgcctacac 


catgggtggg. 


540 


cgctgctcgg 


tcggcttcgc 


ggccaccaac 


gcctccggcc 


agcccgggtt 


cgtcaccgcc 


600 


ggccactgcg 


gcaccgtcgg 


caccccggtc 


agcatcggca 


acggccaggg 


cgtcttcgag 


660 


cgttccgtct 


tccccggcaa 


cgactccgcc 


ttcgtccgcg 


gcacctcgaa 


cttcaccctg 


720 


accaacctgg 


tcagccgcta 


caacaccggt 


ggttacgcga 


ccgtctccgg 


ctcctcgcag 


780 


gcggcgatcg 


gctcgcagat 


ctgccgttcc 


ggctccacca 


ccggctggca 


ctgcggcacc 


840 


gtccaggccc 


gcggccagac 


ggtgagctac 


ccccagggca 


ccgtgcagaa 


cctgacccgc 


900 


accaacgtct 


gcgccgagcc 


cggtgactcc 


ggcggctcct 


tcatctccgg 


cagccaggcc 


960 


cagggcgtca 


cctccggtgg 


ctccggcaac 


tgctccttcg 


gtggcaccac 


ctactaccag 


1020 


gaggtcaacc 


cgatgctgag 


cagctggggt 


ctgaccctgc 


gcacctga 




1068 
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<210> 33 

<211> 355 

<212> PRT 

<213> Nocardiopsis Alba DSM 15647 



<220> 

<221> PROPEP 
<222> <1)..(167) 



<220> 

<221> mat_peptide 
<222> (168) . . (355) 

<400> 33 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Asp Glu Ala Glu 
-165 -160 -155 



Ala Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser 
-150 -145 -140 



Pro Ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu Ser Phe 
-135 -130 -125 



Glu lie Asp Glu Ala Ala Thr Ala Ala Ala Ala Asp Ser Tyr Gly 
-120 -115 -110 



Gly Ser lie Phe Asp Thr Asp Ser Leu Thr Leu Thr Val Leu Val Thr 
-105 -100 -95 



Asp Ala Ser Ala Val Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys 
-90 -85 -80 



Val Val Ser His Gly Met Glu Gly Leu Glu Glu He Val Ala Asp Leu 
-75 -70 -65 -60 



Asn Ala Ala Asp Ala Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He 
-55 -50 -45 



His Ser Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp 
-40 -35 -30 



Val Asp Ser Leu Leu Ala Asp Ala Gly Val Asp Thr Ala Asp Val Lys 
-25 -20 -15 
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Val Glu Ser Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp He He Gly 
-10 -5 -1 1 5 



Gly Leu Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala 
10 15 20 



Thr Asn Ala Ser Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly 
25 30 35 



Thr Val Gly Thr Pro Val Ser He Gly Asn Gly Gin Gly Val Phe Glu 
40 45 50 



Arg Ser Val Phe Pro Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ser 
55 60 65 



Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr 
70 75 80 85 



Ala Thr Val Ser Gly Ser Ser Gin Ala Ala He Gly Ser Gin He Cys 
90 95 100 



Arg Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr Val Gin Ala Arg 
105 110 115 



Gly Gin Thr Val Ser Tyr Pro Gin Gly Thr Val Gin Asn Leu Thr Arg 
120 125 130 



Thr Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe He Ser 
135 140 ^ 145 



Gly Ser Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser 
150 155 160 165 



Phe Gly Gly Thr Thr Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser Ser 
170 175 180 



Trp Gly Leu Thr Leu Arg Thr 
185 



<210> 34 

<211> 43 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Primer 1421 



<400> 34 

gttcatcgat cgcatcggct gcgaccggcc ccctccccca gtc 43 



<210> 35 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1604 

<400> 35 

gcggatccta tcaggtgcgc agggtcagac c 
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<210> 36 
<211> 1062 
<212> DNA 

<213> Nocardiopsis prasina DSM 15648 
<400> 36 

gccaccggac cgctccccca gtcacccacc ccggaggccg acgccgtctc catgcaggag 60 

gcgctccagc gcgacctcgg cctgaccccg cttgaggccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgcggccg ccggggacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgacgc cgcctcggtc 240 

gaggctgtgg aggccaccgg cgcgggtacc gaactcgtct cctacggcat cgagggcctc 300 

gacgagatca tccaggatct caacgccgcc gacgccgtcc ccggcgtggt cggctggtac 360 

ccggacgtgg cgggtgacac cgtcgtcctg gaggtcctgg agggttccgg agccgacgtg 420 

agcggcctgc tcgccgacgc cggcgtggac gcctcggccg tcgaggtgac cagcagtgcg 480 

cagcccgagc tctacgccga catcatcggc ggtctggcct acaccatggg cggccgctgt 540 

tcggtcggat tcgcggccac caacgccgcc ggtcagcccg gattcgtcac cgccggtcac 600 

tgtggccgcg tgggcaccca ggtgagcatc ggcaacggcc agggcgtctt cgagcagtcc 660 

atcttcccgg gcaacgacgc cgccttcgtc cgcggcacgt ccaacttcac gctgaccaac 720 

ctggtcagcc gctacaacac cggcggttac gccaccgtcg ccggccacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggct ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac ccggaccacc 900 

gtgtgcgccg agcccggcga ctccggcggc tcctacatct ccggcaacca ggcccagggc 960 
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gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 
acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 1062 



<210> 37 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina DSM 15648 



<220> 

<221> PROPEP 

<222> (1)..(165) 

<220> 

<221> mat_peptide 

<222> (166) . . (353) 

<400> 37 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 



Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 



Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 



Val Asp Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 



Ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 



Ala Ala Ser Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
-85 -80 ~ -75 



Val Ser Tyr Gly He Glu Gly Leu Asp Glu He He Gin Asp Leu Asn 
-70 -65 -60 



Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 



Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 
-40 -35 -30 



25 



Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 
-25 -20 -15 -10 



Thr Ser Ser Ala Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 
-5 -11 5 



Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 
10 15 20 



Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val 
25 30 35 



Gly Thr Gin Val Ser He Gly Asn Gly Gin Gly Val Phe Glu Gin Ser 
40 45 50 55 



He Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 
60 65 70 



Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 1 85 



Val Ala Gly His Asn Gin Ala Pro He Gly Ser Ser Val Cys Arg Ser 
90 95 100 



Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 
105 110 115 



Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 



Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly Asn 
140 145 150 



Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 
155 160 165 



Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 



Val Arg Leu Arg Thr 
185 
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<210> 38 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1346 

<400> 38 

gttcatcgat cgcatcggct gccaccggac cgctccccca gtc 43 

<210> 39 

<211> 38 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1602 

<400> 39 

gcggatccta ttaggtccgg agacggacgc cccaggag 38 

<210> 40 

<211> 1062 

<212> DNA 

<213> Nocardiopsis prasina DSM 15649 

<400> 40 



gccaccggac 


cactccccca 


gtcacccacc 


ccggaggccg 


acgccgtctc 


catgcaggag 


60 


gcgctccagc 


gcgacctcgg 


cctgaccccg 


cttgaggccg 


atgaactgct 


ggccgcccag 


120 


gacaccgcct 


tcgaggtcga 


cgaggccgcg 


gccgaggccg 


ccggtgacgc 


ctacggcggc 


180 


tccgtcttcg 


acaccgagac 


cctggaactg 


accgtcctgg 


tcaccgactc 


cgccgcggtc 


240 


gaggcggtgg 


aggccaccgg 


cgccgggacc 


gaactggtct 


cctacggcat 


cacgggcctc 


300 


gacgagatcg 


tcgaggagct 


caacgccgcc 


gacgccgttc 


ccggcgtggt 


cggctggtac 


360 


ccggacgtcg 


cgggtgacac 


cgtcgtgctg 


gaggtcctgg 


agggttccgg 


cgccgacgtg 


420 


ggcggcctgc 


tcgccgacgc 


cggcgtggac 


gcctcggcgg 


tcgaggtgac 


caccaccgag 


480 


cagcccgagc 


tgtacgccga 


catcatcggc 


ggtctggcct 


acaccatggg 


cggccgctgt 


540 


tcggtcggct 


tcgcggccac 


caacgccgcc 


ggtcagcccg 


ggttcgtcac 


cgccggtcac 


600 


tgtggccgcg 


tgggcaccca 


ggtgaccatc 


ggcaacggcc 


ggggcgtctt 


cgagcagtcc 


660 


atcttcccgg 


gcaacgacgc 


cgccttcgtc 


cgcggaacgt 


ccaacttcac 


gctgaccaac 


720 


ctggtcagcc 


gctacaacac 


cggcggctac 


gccaccgtcg 


ccggtcacaa 


ccaggcgccc 


780 


atcggctcct 


ccgtctgccg 


ctccggctcc 


accaccggtt 


ggcactgcgg 


caccatccag 


840 
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gcccgcggcc 


agtcggtgag 


ctaccccgag 


ggcaccgtca 


ccaacatgac 


gcggaccacc 


900 


gtgtgcgccg 


agcccggcga 


ctccggcggc 


tcctacatct 


ccggcaacca 


ggcccagggc 


960 


gtcacctccg 


gcggctccgg 


caactgccgc 


accggcggga 


ccaccttcta 


ccaggaggtc 


1020 


acccccatgg 


tgaactcctg 


gggcgtccgt 


ctccggacct 


aa 




1062 



<210> 41 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina DSM 15649 



<220> 

<221> PROPEP 

<222> (1)..(165) 

<220> 

<221> mat_peptide 

<222> (166) . . (353) 

<400> 41 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 



Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 ^ ^ -140 



Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 



Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 



Ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 



Ser Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
-85 -80 -75 



Val Ser Tyr Gly He Thr Gly Leu Asp Glu He Val Glu Glu Leu Asn 
-70 -65 -60 



Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 
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Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 
-40 -35 -30 



Gly Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 
-25 -20 -15 -10 



Thr Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp He He Gly Gly Leu 
-5 -11 5 



Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 
10 15 20 



Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val 
25 30 35 



Gly Thr Gin Val Thr He Gly Asn Gly Arg Gly Val Phe Glu Gin Ser 
40 45 50 55 



He Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 
60 65 70 



Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 ' 85 



Val Ala Gly His Asn Gin Ala Pro He Gly Ser Ser Val Cys Arg Ser 
90 95 100 



Gly Ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin 
105 110 115 



Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 



Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly Asn 
140 145 J 150 



Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 
155 160 165 



Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 
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Val Arg Leu Arg Thr 
185 



<210> 42 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1603 

<400> 42 

gttcatcgat cgcatcggct gccaccggac cactccccca gtc 

<210> 43 

<211> 353 

<212> PRT 

<213> Nocardiopsis sp. NRRL 18262 



<220> 

<221> PROPEP 

<222> (1)..(165) 

<220> 

<221> mat_peptide 

<222> (166) . . (1059) 

<400> 43 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160- -155 



Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser 
-150 -145 ~ ^ -140 



Ala Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 



Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 



Ser Val Phe Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 



Ala Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
-85 -80 "* -75 



Val Ser Tyr Gly He Asp Gly Leu Asp Glu He Val Gin Glu Leu Asn 
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-70 



-65 



-60 



Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 



Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 
-40 -35 -30 



Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 
-25 -20 -15 -10 



Thr Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp He He Gly Gly Leu 
-5 -11 5 



Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 
10 15 20 



Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val 

25 30 35 " " • 



Gly Thr Gin Val Thr He Gly Asn Gly Arg Gly Val Phe Glu Gin Ser 
40 45 50 55 



Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 
60 65 ~ 70 



Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 ^ ~ 85 



Val Ala Gly His Asn Gin Ala Pro He Gly Ser Ser Val Cys Arg Ser 
90 95 100 



Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 
105 110 115 



Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 



Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie Ser Gly Thr 
140 145 150 



Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 
155 160 165 
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Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 



Val Arg Leu Arg Thr 
185 



<210> 44 

<211> 1164 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic protease encoding gene 



<220> 

<221> CDS 

<222> (1)..(1164) 

<223> Full length protease 

<220> 

<221> sig_peptide 
<222> (1)..(81) 

<220> 

<221> misc_feature 

<222> (82) . . (1164) 

<223> Propeptide 

<220> 

<221> mat_jpeptide 
<222> (577) (1164) 

<400> 44 

atg aaa aaa ccg ctg gga aaa att gtc gca age aca gca ctt ctt 45 

Met Lys Lys Pro Leu Gly Lys lie Val Ala Ser Thr Ala Leu Leu 
-190 J ' -185 -180 

att tea gtg gca ttt age tea tct att gca tea gca get aca gga 90 
lie Ser Val Ala Phe Ser Ser Ser lie Ala Ser Ala Ala Thr Gly 
-175 -170 -165 

gca tta ccg cag tct ccg aca ccg gaa gca gat gca gtc tea atg 135 
Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val Ser Met 
-160 -155 -150 

caa gaa gca ctg caa aga gat ctt gat ctt aca tea gca gaa gca 180 
Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu Ala 
-145 -140 -135 

gaa gaa ctt ctt get gca caa gat aca gca ttt gaa gtg gat gaa 225 
Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
-130 -125 -120 



32 



gca gcg gca gaa gca gca gga gat gca tat ggc ggc tea gtt ttt 270 
Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe 
-115 -110 -105 

gat aca gaa tea ctt gaa ctt aca gtt ctt gtt aca gat gca gca gca 318 
Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
-100 -95 -90 

gtt gaa gca gtt gaa gca aca gga gca gga aca gta ctt gtt tea tat 366 
Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Val Leu Val Ser Tyr 
-85 -80 ' -75 

gga att gat ggc ctt gat gaa att gtt caa gaa ctg aat gca get gat 414 
Gly He Asp Gly Leu Asp Glu He Val Gin Glu Leu Asn Ala Ala Asp 
-70 -65 -60 -55 

get gtt ccg ggc gtt gtt ggc tgg tat ccg gat gtt get gga gat aca 4 62 

Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly Asp Thr 
-50 -45 -40 

gtt gtc ctt gaa gtt ctt gaa gga tea ggc gca gat gtt tea ggc ctg 510 
Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Ser Gly Leu 
-35 -30 -25 

ctg gca gac gca gga gtc gat gca tea gca gtt gaa gtt aca aca tea 558 
Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr Thr Ser 
-20 -15 -10 

gat caa ccg gaa ctt tat gca gat att att ggc ggc ctg gca tat tat 606 
Asp Gin Pro Glu Leu Tyr Ala Asp He He Gly Gly Leu Ala Tyr Tyr 
-5 -11 5 ^ 10 

atg ggc ggc aga tgc age gtt ggc ttt gca gca aca aat gca tea ggc 654 
Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn Ala Ser Gly 
15 20 25 

caa ccg ggc ttt gtt aca gca ggc cat tgc ggc aca gtt ggc aca cca 702 
Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr Val Gly Thr Pro 
30 35 40 

gtt tea att ggc aat ggc aaa ggc gtt ttt gaa cga age att ttt ccg 750 
Val Ser He Gly Asn Gly Lys Gly Val Phe Glu Arg Ser He Phe Pro 
45 50 55 

ggc aat gat tea gca ttt gtt aga ggc aca tea aat ttt aca ctt aca 798 
Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ser Asn Phe Thr Leu Thr 
60 65 70 

aat ctg gtt tea aga tat aat tea ggc ggc tat gca aca gtt gca ggc 846 
Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Ala Thr Val Ala Gly 
75 80 85 90 

cat aat caa gca ccg att ggc tea gca gtt tgc aga tea ggc tea aca 894 
His Asn Gin Ala Pro He Gly Ser Ala Val Cys Arg Ser Gly Ser Thr 
95 100 105 
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aca ggc tgg cat tgc ggc aca att caa gca aga aat caa aca gtt agg 942 
Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Asn Gin Thr Val Arg 
110 115 120 

tat ccg caa ggc aca gtt tat agt ctg aca aga aca aca gtt tgt gca 990 
Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr Thr Val Cys Ala 
125 130 " 135 

gaa ccg ggc gat tea ggc ggc tea tat att age ggc act caa gca caa 1038 
Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly Thr Gin Ala Gin 
140 145 150 

ggc gtt aca tea ggc ggc tea ggc aat tgc agt get ggc ggc aca aca 1086 
Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Ala Gly Gly Thr Thr 
155 160 165 170 

tat tac caa gaa gtt aat ccg atg ctt agt tea tgg ggc ctt aca ctt 1134 
Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser Ser Trp Gly Leu Thr Leu 
175 180 185 

aga aca caa teg cat gtt caa tec get cca 1164 
Arg Thr Gin Ser His Val Gin Ser Ala Pro 
190 195 



<210> 45 
<211> 388 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic Construct 
<400> 45 

Met Lys Lys Pro Leu Gly Lys He Val Ala Ser Thr Ala Leu Leu 
-190 -185 -180 



He Ser Val Ala Phe Ser Ser Ser He Ala Ser Ala Ala Thr Gly 
-175 -170 -165 



Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val Ser Met 
-160 -155 * -150 



Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu Ala 
-145 -140 -135 



Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
-130 -125 -120 



Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe 
-115 -110 -105 
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Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
-100 -95 -90 



Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Val Leu Val Ser Tyr 
-85 -80 -75 



Gly He Asp Gly Leu Asp Glu He Val Gin Glu Leu Asn Ala Ala Asp 
-70 -65 -60 -55 



Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly Asp Thr 
-50 -45 -40 



Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Ser Gly Leu 
-35 -30 -25 



Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr Thr Ser 
-20 -15 -10 



Asp Gin Pro Glu Leu Tyr Ala Asp He He Gly Gly Leu Ala Tyr Tyr 
-5 -11 5 " 10 



Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn Ala Ser Gly 
15 20 25 



Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr Val Gly Thr Pro 
30 35 40 



Val Ser He Gly Asn Gly Lys Gly Val Phe Glu Arg Ser He Phe Pro 
45 50 55 



Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ser Asn Phe Thr Leu Thr 
60 65 70 



Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Ala Thr Val Ala Gly 
75 80 " ' ^ ^ 



His Asn Gin Ala Pro He Gly Ser Ala Val Cys Arg Ser Gly Ser Thr 
95 100 ~ 105 



Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Asn Gin Thr Val Arg 
110 115 ' 120 
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Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr Thr Val Cys Ala 
125 130 135 



Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie Ser Gly Thr Gin Ala Gin 
140 145 150 



Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Ala Gly Gly Thr Thr 
155 160 165 J 170 



Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser Ser Trp Gly Leu Thr Leu 
175 180 ~ 185 



Arg Thr Gin Ser His Val Gin Ser Ala Pro 
190 195 



<210> 46 

<211> 165 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide 0-2.19 



<220> 

<221> PROPEP 
<222> (1)..(165) 

<400> 46 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
15 10 15 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu 
20 25 30 



Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
35 40 45 



Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe Asp 
50 55 60 



Thr Glu Ser Leu Thr Leu Thr Val Leu Val Thr Asp Ala Ser Ala Val 
65 70 75 80 



Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
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85 90 95 



Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 



Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 



Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Asp Ser Leu Leu 
130 135 140 



Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr Thr Ser Asp 
145 150 155 160 



Gin Pro Glu Leu Tyr 
165 



<210> 47 

<211> 166 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-2.73 



<220> 

<221> PROPEP 
<222> (1) . . (166) 

<400> 47 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
1 5 10 15 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Ser Ser Ala Glu 
20 25 ^ 30 



Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
35 40 45 



Ala Ala Ala Gly Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe Asp 
50 55 60 



Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ser Ala Val 
65 70 75 80 
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Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
85 90 95 



Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 



Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 



Val Val Glu Val Leu Glu Gly Ser Gly Ala Asp Val Asp Ser Leu Leu 
130 135 " 140 



Ala Asp Ala Gly Val Asp Thr Ala Asp Val Lys Val Glu Ser Thr Thr 
145 150 155 160 



Glu Gin Pro Glu Leu Tyr 
165 



<210> 48 

<211> 166 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-1.43 



<220> 

<221> PROPEP 
<222> (1)..{166) 

<400> 48 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
15 10 15 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser Ser Ser Gin 
20 25 ~ 30 



Ala Glu Glu Leu Leu Asp Ala Gin Ala Glu Ser Phe Glu He Asp Glu 
35 40 45 



Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser He Phe Asp 
50 55 60 



Thr Asp Ser Leu Thr Leu Thr Val Leu Val Thr Asp Ala Ser Ala Val 
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65 



70 



75 



80 



Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
85 90 95 



Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 



Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 



Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Asp Ser Leu Leu 
130 135 140 



Ala Asp Ala Gly Val Asp Thr Ala Asp Val Lys Val Glu Ser Thr Thr 
145 150 155 160 



Glu Gin Pro Glu Leu Tyr 
165 



<210> 49 
<211> 166 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-2.6 
<400> 49 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
15 10 15 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu 
20 25 30 



Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
35 40 45 



Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser He Phe Asp 
50 55 60 



Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp Ser Ser Ser Val 
65 70 75 80 
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Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
85 90 95 



Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 



Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 



Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Asp Ser Leu Leu 
130 135 ~ 140 



Ala Gly Ala Gly Val Asp Thr Ala Asp Val Lys Val Glu Ser Thr Thr 
145 150 155 160 



Glu Gin Pro Glu Leu Tyr 
165 



<210> 50 

<211> 165 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-2.5 



<220> 

<221> PROPEP 
<222> (1)..(165) 

<400> 50 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
15 10 15 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro Leu Glu 

20 25 30 



Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
35 40 45 



Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe Asp 
50 55 ^ 60 



Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ser Ala Val 
65 70 75 80 
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Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
85 90 95 



Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 



Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 



Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Asp Ser Leu Leu 
130 135 ~ 140 



Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr Pro Ala Ala 
145 150 155 160 



Arg Pro Glu Leu Tyr 
165 



<210> 51 

<211> 166 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-2.3 



<220> 

<221> PROPEP 
<222> (1)..(166) 

<400> 51 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Asp Gly Ala Glu Ala 
1 5 10 1 15 



Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro Ala 
20 25 30 



Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp 
35 40 45 



Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser He Phe 
50 55 60 
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Asp Thr Asp Ser Leu Thr Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
65 70 75 80 



Val Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His 
85 90 95 



Gly Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp 
100 105 110 



Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly Asp Thr 
115 120 125 



Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Tyr Ser Leu 
130 135 140 



Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr Pro Ala 
145 150 155 160 



Ala Gin Pro Glu Leu Tyr 
165 



<210> 52 

<211> 166 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-1.4 



<220> 

<221> PROPEP 
<222> (1)..(166) 

<400> 52 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
15 10 15 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser Ser Ser Gin 
20 25 30 



Ala Glu Glu Leu Leu Asp Ala Gin Ala Glu Ser Phe Glu He Asp Glu 
35 40 45 



Ala Ala Ala Ala Ala Ala Ala Asp Ser Tyr Gly Gly Ser He Phe Asp 
50 55 ~ 60 
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Thr Asp Ser Leu Thr Leu Thr Val Leu Val Thr Asp Ala Ser Ala Val 
65 70 75 80 



Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
85 90 95 



Met Glu Gly Leu Glu Glu lie Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 



Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 



Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Asp Ser Leu Leu 
130 135 * 140 



Ala Asp Ala Gly Val Asp Thr Ala Asp Val Lys Val Glu Ser Thr Thr 
145 150 155 160 



Glu Gin Pro Glu Leu Tyr 
165 



<210> 53 

<211> 166 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-1.2 



<220> 

<221> PROPEP 
<222> (1)..(166) 

<400> 53 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
15 10 15 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu 
20 25 30 



Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
35 40 45 
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Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser lie Phe Asp 
50 55 60 



Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp Ser Ser Ser Val 
65 70 75 80 



Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
85 90 95 



Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 



Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 



Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Asp Ser Leu Leu 
130 135 140 



Ala Gly Ala Gly Val Asp Thr Ala Asp Val Lys Val Glu Ser Thr Thr 
145 150 155 160 



Glu Gin Pro Glu Leu Tyr 
165 
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